Self healing distributed systems

نویسنده

  • Benjamin Satzger
چکیده

The growing complexity of distributed systems demands for new ways of control. This work addresses self-healing in distributed environments. The term self-healing represents a quite new area of research and is used in a fairly broad way, but can be seen as dynamic fault tolerance. This work proposes generic concepts and algorithms to build self-healing systems. The detection of node failures in distributed environments is a non-trivial problem. Failure detectors are an important component of many fault tolerant distributed systems. In this work a new failure detection algorithm is proposed with noteworthy features like a high flexibility and good performance. Furthermore an approach is presented to save the message overhead of failure detectors. New grouping algorithms are introduced in this work to enable a scalable self-monitoring property. This allows an autonomous installation of monitoring relations in complex large scale distributed systems. A failure recovery engine based on automated planning, which manages a distributed system according to user-defined objectives, is proposed. It is able to generate and execute plans to autonomously recover a system from unwanted states. Finally, ideas for a generic self-healing architecture for highly complex distributed systems are presented. The design is based on psychological and sociological concepts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Self-Healing Systems: Foundations and Challenges

The term and characteristic of self-healing, applied to systems, is often seen from different fields of computer science, such as fault tolerance or network and service management, with diverging semantics. Since this impression was confirmed also during the first discussions of the Dagstuhl seminar on ”Self-Healing and Self-Adaptive Systems”, a seminar’s working group on ”Terminology” was form...

متن کامل

Biology-inspired self-healing system design

When engineering self-healing systems, inspiration can be sought from nature. Biological organisms present proven self-healing models that have been developed during millions of years of evolution. In this paper common self-healing properties found in nature are described. A programming paradigm inspired by properties of the biological cell is discussed, and a software architecture for distribu...

متن کامل

Self-healing in payment switches with a focus on failure detection using State Ma- chine-based approaches

Composition, change and complexity have attracted ev- eryone’s attention towards Self-Adaptive systems. These systems, inspired by the human body, are capable of adapting to changes in the inner and outer environment. The main objective of this study is to achieve a more con- venient availability for e-banking services in the payment switch, using self-healing systems and focusing on the failur...

متن کامل

Robust Agent Based Distribution System Restoration with Uncertainty in Loads in Smart Grids

This paper presents a comprehensive robust distributed intelligent control for optimum self-healing activities in smart distribution systems considering the uncertainty in loads. The presented agent based framework obviates the requirements for a central control method and improves the reliability of the self-healing mechanism. Agents possess three characteristics including local views, decentr...

متن کامل

Self-healing in payment switches with a focus on failure detection using State Ma- chine-based approaches

Composition, change and complexity have attracted ev- eryone’s attention towards Self-Adaptive systems. These systems, inspired by the human body, are capable of adapting to changes in the inner and outer environment. The main objective of this study is to achieve a more con- venient availability for e-banking services in the payment switch, using self-healing systems and focusing on the failur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008